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1 Introduction 

The standard sum-product inequality, as developed by Erdos and Szemeredi 
[2], and improved upon by Elekes [I], Ford [3], Nathanson [I], and Solymosi 
[6] [7] , asserts that if A is a set of n real numbers (though Erdos and Szemeredi 
originally prove their theorem for ACZ), then 

\A + A\ + \A.A\ > n 1+£ , whenever n > n . 

It is interesting to consider whether one can prove a finer version of this 
theorem, where one is allowed to "perturb" the products ab G A. A a little 
bit: suppose that for each pair a,b G A we get to choose "perturbation 
parameters" 5 a ,b and 5' ab , where say 

5 a , b ,5' a:b G [-1,1]. 

Given these parameters, define the "perturbed product set" 

P := {(a + 5 a}b )(b + 5' aib ) : a,b G A}. 

Must it be the case that for all choices of S at b, S' a b we get that 

\A + A\ + \P\ > n l+£ ? 
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The answer is obviously 'no', since if the elements of A are close enough 
together, and are in arithmetic progression, we can easily choose S a ^ and S' a b 
for every c G A, so that 

\A+ A\ + \P\ = 2n. 

But what if we add the condition that the elements of A are all spaced at 
least 1 apart? Can we show that \A + A\ + \P\ must always be large? 

Again the answer is 'no', but the natural example is a little more compli- 
cated. Basically, consider the arithmetic progression 

A := {x + l,x + 2, ...,x + n}. (1) 

This set has the property that it is "close" to the geometric progression 

{x(l + l/xY : .7 = 0,1, ..,n-l}. (2) 

Indeed, 

x(l + l/x) j = x+j + 0(j 2 /x), 

for x > n. So, clearly, for x big enough relative to n we will have that if the 
&a,h, <5'ab are allowed to vary over [—1,1], we can force 

P = A + A, 

thereby making 

\A + A\ + \P\ < An -2. 

But we can give even better upper bounds on 5 a ,b and 5' ab that make 
\P\ + \A + A\ small: first, set all 5' ab = 0. Then suppose that a and b 
correspond to the numbers (|2J) using exponents j and k, respectively. Next, 
we choose 5 a ^ <C n/x so that 

{a + 5 a)b )b = x\l+j/x + 0(j 2 /x 2 ) + 5 a>b )(l + k/x + 0(k 2 /x 2 )) 
= x 2 (l + (j + k)/x + 5 a , b + 0(n 2 /x 2 )). 

So, if 6 a ,b is the negative of this 0{n 2 /x 2 ) < 0(n/x), then the product will 
be just x 2 (l + (j + k)/x), meaning that we can make our perturbed product 
set into an arithmetic progression, making \P\ + \A + A\ n. 

So, to achieve a lower bound on \P\ + \ A + A\ that is substantially better 
than the trivial bound, we will need that 5 a ^, S' a b can only vary over intervals 



2 



that are width 0(n/x). Our main theorem below will show that this is 
essentially best possible, as the intervals for 5 a ^ and 5' ab leading to non- 
trivial results will have lengths n 1 ~ £ /a and n 1 ~ £ /b, respectively. Actually, 
our theorem is even more general, since following Remark 2 at the end of our 
theorem, the sumset A + A can also be perturbed, and still we get a good 
lower bound on the resulting perturbed sum and product sets. 
Our main theorem is as follows: 

Theorem 1 Suppose < e < 1 and let A be a set of n > n (e) positive real 
numbers, all at least 1 apart. For each pair (a, b) e A x A, suppose that 5 a ^ 
and 6' b are arbitrary real numbers satisfying 

M < — , and |<^ b | < — . (3) 
Finally, define the perturbed product set 

P ■= {(a + 6 a , b )(b + 5' atb ) : a,beA}. 



Then, we have that 



\P\ + \A + A\ > 



n l+e/9 

logn ' 



Remark 1. Obviously, if the elements of A are not at least 1 apart, we can 
rescale to make it true. Furthermore, all we really need is that the median 
of the gaps between consecutive elements of A is at least 1, since by deleting 
at most n/2 elements from A we get a set of elements that are all at least 1 
apart. 

Remark 2. From the proof one can show that if we also perturb the sums 
A + A, we have the same quality lower bound on the sums and products; 
that is, suppose we define S to be the set of all perturbed sums a + b + S" >b , 
where \5" >b \ < n 1_e /(a + b). Then, we can show 

„l+e/9 

\P\ + \S\ » 4 . (4) 

log n 

Basically, the reason we can show this is that in the first parts of the proof, 
we pass to a subset B C. A, contained in a dyadic interval [x, 2x), whose 
set of perturbed products or sums B + B we show must contain at least 
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n 1+£ / 9 /3 logn elements. This then means that \8" b \ < n 1 ~ £ /x; and, then we 
can bound \S\ from below by a constant multiple of the set of sums of C + C , 
where C is the set of elements of B rounded to the nearest multiple of n l ~ £ jx. 
Rewriting the perturbed products for B in terms of perturbed products for 
C, it is easy to see that this implies (jl]). 



2 Proof of Theorem d 
2.1 Preliminaries 

We will basically follow a variant of Elekes's original argument used to prove 
that if A is a set of n reals, then 

\A + A\ ■ \A.A\ > n 5/2 , 

from which it follows that 

\A + A\ + \A.A\ > n 5/4 . 

But our approach will differ in that the Szemeredi- Trotter theorem [9j is not 
directly amenable to our particular approach. Instead, we apply a very minor 
generalization of the Szemeredi- Trotter curve theorem of Szekely [5] (hardly 
any generalization at all) , which follows by the same proof as that of Szekely. 

Theorem 2 Suppose that one has a collection of £ non-self-crossing curves 
and p points. Let C denote the collections of curves. Let mi denote the 
maximal number of curves that can pass through any given pair of points 
Pi,P2, and let m 2 denote the "average intersection multiplicity", defined as 
follows 

7712 := U) S l c i nc 2l- 

Then, the number of incidences I, which is the number of point-curve pairs, 
where the point is on the curve, satisfies 

I < (m 1 m 2 ) l/3 (p£) 2/3 + £ + mip. 
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The way this differs from the Szemeredi- Trotter curve theorem in [5] is 
that m 2 is the average intersection multiplicity among the curves, not an 
absolute upper limit on the intersection multiplicity between pairs of curves. 

The proof of Szekely begins with the following result on the crossing 
number cr(G) of a multigraph G, as appears in [8], Theorem 7]. 

Theorem 3 Suppose that G is a multigraph with n nodes, e edges and edge 
multiplicity m. Then, either e < 5nm or cr(G) > ce 3 /(n 2 m). 

Proof of Theorem [2J Now to prove Theorem [2] we construct a graph as 
follows: fix one of our i curves, and consider which of our n points happens 
to lie on it. By choosing a direction with which to traverse the curve, we 
create an ordering of these incident points. If there are x such points on the 
given curve, then we form x — 1 curve segments that adjoin consecutive pairs 
of points. We throw away the "infinite parts" of the curves as they will play 
no further role in our proof. 

Letting / denote our total number of incidences, and e the number of 
edges in our graph, we will have 



since the number of edges each curve contributes is one less than its number 
of incident points. Letting C denote our set of curves, we also have that 



Finally, note that in our drawing of the graph m = m\. 

Putting all this together, we either have that e < 5min, which would 
imply 



e = 



I -I 




I 



e + £ < hmin + £, 



which implies our theorem, or else 



(I - £f/(n 2 mi ) = e 3 /(n 2 mi ) < cr(G) < 




Theorem [2] is now proved. 
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2.2 Restricting to a dyadic interval 

Later, we will require an upper bound on the number of curves passing 
through pairs of grid points, and to achieve such upper bounds it will be 
good to first pass to elements of A that lie in a dyadic interval. To this end 
we will require the following lemma. 

Lemma 1 Suppose that A is a set of n real numbers satisfying \ A + A\ < 
n 1+5 /3 logn. Then, there exists a dyadic interval [x, 2x) containing at least 
n 1 ^ 5 elements of A. 

A version of this lemma can be proved without too much trouble us- 
ing only very elementary ideas; however, we give a proof using the Ruzsa- 
Plunnecke inequality, since it makes the proof short and transparent. First, 
let us state the Ruzsa-Plunnecke inequality [5]. 

Theorem 4 Suppose that A is a subset of an additive abelian group, such 
that 

\A + A\ < K\A\. 

Then, 

\kA-lA\ = \A + A-\ V A - A - A A\ < K k+e \A\. 



Proof of Lemma [JJ. Suppose, for proof by contraposition, that every dyadic 
interval contains fewer than n l ~ s elements of A. Then, it requires at least 
n s disjoint dyadic intervals to contain all the elements of A, and therefore 
choosing one element from every other dyadic interval (if these disjoint dyadic 
intervals are put into increasing order), we get a sequence of at least n <5 /2 
elements 

A' := {a 1 ,...,a k } C A, k > n s /2, 

such that 

CLi+l/di > 2. 

It is a simple matter to show that all the fc-fold sums of distinct elements 
of A' are distinct (think about the usual proof that binary number represen- 
tations are unique); and so, if we assume that \A + A\ = K\A\, then from 
Theorem H] 

{n s /k) k < ( n5 ] 2 \ < \kA'\ < \kA\ < K k n. 
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It follows that 

K > n s - 1/k /k. 
Choosing k ~ logn we get that 

K > n 5 /31ogn. 

But this means that 

\A + A\ = Kn > n 1+5 /3\ogn, 

so the lemma is proved. ■ 

Now we choose the dyadic interval [x, 2x) containing the most elements 
of A. We may assume that this interval contains at least n 1_K£ elements of 
A, where we define the constant 

k = 1/9, 

since otherwise Lemma [H with 5 = ke implies that \A + A\ > n 1+K£ /3 logn, 
thereby proving Theorem [TJ 

Let B denote those > n 1_K£ elements of A contained in [x, 2x). We note 
that if we consider the products 

(a + S ayb )(b + S' ab ), a,b e B, 

then as we vary over all legal choices for S a ,b and S' a b , since a and b lie in the 
same dyadic interval [x, 2x), we have that the possible values of this product 
must lie in an interval of width at most 

2n l - £ + n 2 - 2£ /x 2 . 

Since the gap between elements of A is at least 1, we have x ^> n, making 
our interval of width 

< A := n l ~ e . 

So, the number of our "perturbed products" is at least a constant multiple 
of the set of distinct values < ab >, a,b e B, where < t > denotes t rounded 
to the nearest multiple of A. 
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2.3 A family of polygonal curves 

In order to apply this theorem, we need to define some points and curves 
that are relevant to our problem: we begin by letting X be the elements of 
B + B; and then we let Y be the elements of B.B rounded to the nearest 
multiple of A. Our set of points will then be X x Y; so, there will be \X\ ■ \Y\ 
of them in total. 

Now we define the curves: we begin by considering, for each a,b G B, the 
set of points on the line 

y = a(x-b), 

with x G B + b. Then, we round the y coordinate to the nearest multiple of 
A. Sweeping from left-to-right across the grid, we connect consecutive points 
by line segments. 

2.4 Perturbing the curves 

At this point we could have that some pairs of curves intersect in a segment, 
and therefore have infinitely many points of intersection. But this is easily 
fixed by making a small perturbation to the curves, replacing the shared 
segments by closely drawn curves that are nearly parallel and only intersect 
at grid points. We furthermore can assume that if a pair of points is common 
to two or more curves, then those points must be grid points (again, by 
perturbing the curves slightly, while still connecting the same grid points as 
before). 

2.5 The average crossing number 

Now we calculate the average intersection multiplicity of pairs of curves: first, 
we observe that the polygonal arcs are at most a vertical distance A from 
their corresponding straight lines. And therefore, two of these polygonal arcs, 
corresponding, say, to the curves 

y = a(x — b) and y = c(x — d), c> a, 

can only cross if x is such that 

\c(x -d)-a(x-b)\ < 2 A. 

In other words, 

|(c - a)x + ab - cd\ < 2n 1 ~ £ . 
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Furthermore, between any consecutive x-values of the set 



{b,c} + B, (5) 

there can be at most one crossing between the pair of polygonal arcs. 

Now, since in any three consecutive points of (jHJ), two must either belong 
to b + B or to c + B, we have that every other point of (jSJ) is at least 1 apart. 
It follows, therefore, that the number of crossings between the two polygonal 
arcs is at most 

1 + An 1 ~ e /(c-a). 

(The 1 here accounts for the "left-most point of intersection", and once we 
are given this point, there can be at most 4n 1 ~ E /(c — a) other intersection 
points to the right of it.) Thus, the average intersection multiplicity, is easily 
seen to be bounded from above by 

2n x - e v-^ 1 2 



n 2 ' c — a n 1+£ ^— ' (j — i) 

" cEB l<i<j<n w ' 

>a 

< l 21ogn 



2.6 The number of curves meeting in a pair of points 

Next, we need to produce an upper bound on the number of curves that can 
meet in a pair of points. We begin by noting, by the way we perturbed the 
curves in subsection 12.41 that in order for two or more curves to meet in a 
pair of points, those points must both be elements of B + B\ furthermore, if 
any of such curves corresponds, say, to a line y = a(x — b), then that pair of 
points must lie in B + b. And so, the pair of points must be at least 1 apart. 

It is easy to see, then, that the curves meeting in a pair of points, 
(xi,yi), (x2,y2), X2 > xi + 1, are at most in number the set of lines of the 
form 

fi(x) = Oi(x-bi), 

where all the fi(xi) come within A of one another, and the same should hold 
for fi(x2)- But this implies that 

\(a i -a j )(x 1 -x 2 )\ < 2 A. (6) 
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First, let us see that no two of these lines can have the same slope: if 
they did, say a, = aj, then just considering the contribution of the point with 
x — Xi, we would have that 

di(xi - bi) = aj(xi - bj), 

and therefore bi = bj. But this can only hold if the two lines are in fact that 
same, so we may assume the slopes are different. Assuming this, we find 
from that 

k-%| < 2A = 2n 1 ~ e . 

It is clear that, since the G B are all at least 1 apart, there can be at most 
0(n 1_£ ) choices for the slopes such that all pairwise differences \ai — aj\ 
satisfy this bound. 

We have therefore proved that 

mi <C n 1 ^ 6 . 

Here we are assuming that e < 1, because of course we know that mi > 1. 
2.7 Conclusion of the proof 

Putting everything together, since our \B\ 2 = n 2 ~ 2K£ lines hit the grid X xY 
in \B\ = n l ~ KS points each, we have that the number of incidences is n 3 ~ 3K£ . 
Yet , from Theorem [2] we find that the number of incidences is 

< { mi m 2 ) l l\\X\-\Y\-n 2 - 2Ke ) 2 lz < n^-^dXl-lFl-n 2 - 2 -) 2 / 3 . 

So, 

I -XI ■ \Y\ > n 2 - £ ( 5 */2-l/2) 

So, 

| X | + \Y\ » n l+-(V4-5«/4) ^ n l+e/^ 

Note that here is where we used the fact that e is sufficiently small - it 
allowed us to ignore the contribution of the terms I + rriip. 
This completes the proof. 
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